智能论文笔记

On Decentralizing Federated Reinforcement Learning in Multi-Robot Scenarios

Jayprakash S. Nair , Divya D. Kulkarni , Ajitem Joshi , Sruthy Suresh

分类：机器人 | 人工智能

2022-07-19

联合学习（FL）允许在几个计算设备上进行协作汇总的学习信息，并在其中共享相同的信息，从而解决隐私问题和巨大的带宽。 FL技术通常使用中央服务器或云来汇总从设备接收的模型。这种集中的FL技术遭受了固有的问题，例如中央节点的故障和频道带宽中的瓶颈。当FL与用作设备的连接机器人结合使用时，中央控制实体的故障会导致混乱的情况。本文介绍了基于移动代理的范例，以分散多机器人方案中的FL。我们使用Webot，一种流行的免费开源机器人模拟器和移动代理平台塔塔鲁斯（Tartarus），我们提出了一种方法，以在一组连接的机器人中分散联合学习。通过在不同的连接计算系统上运行的Webot，我们显示移动代理如何执行分散的联合加固学习（DFRL）的任务。通过使用Q-学习和SARSA进行的实验获得的结果，通过汇总其相应的Q表，显示出在机器人技术域中使用分散的FL的生存能力。由于所提出的工作可以与其他学习算法和真正的机器人结合使用，因此它可以用来使用异质学习算法在多机器人方案中同时使用异质学习算法进行研究。

translated by 谷歌翻译

MissMarple : A Novel Socio-inspired Feature-transfer Learning Deep Network for Image Splicing Detection

Angelina L. Gokhale , Dhanya Pramod , Sudeep D. Thepade , Ravi Kulkarni

分类：计算机视觉 | 机器学习

2021-12-15

在本文中，我们提出了一种用于图像剪接检测的新型社会启发卷积神经网络（CNN）深度学习模型。基于从检测到粗略拼接图像区域的前提是可以改善视觉上不可察觉的剪接图像锻炼的检测，所提出的模型称为MissMarple，是涉及特征转移学习的双CNN网络。通过培训和测试所提出的模型，使用哥伦比亚剪接，WildWeb，DSO1和拟议数据集的培训和测试所提出的模型，标题为Abhas，由现实的剪接锻炼组成，揭示了现有深度学习模型的检测精度的提高。

translated by 谷歌翻译

Learning, Computing, and Trustworthiness in Intelligent IoT Environments: Performance-Energy Tradeoffs

Beatriz Soret , Lam D. Nguyen , Jan Seeger , Arne Bröring , Chaouki Ben Issaid , Sumudu Samarakoon , Anis El Gabli , Vivek Kulkarni , Mehdi Bennis , Petar Popovski

分类：人工智能

2021-10-04

智能物联网环境（iiote）由可以协作执行半自动的IOT应用的异构装置，其示例包括高度自动化的制造单元或自主交互收获机器。能量效率是这种边缘环境中的关键，因为它们通常基于由无线和电池运行设备组成的基础设施，例如电子拖拉机，无人机，自动引导车辆（AGV）S和机器人。总能源消耗从多种技术技术汲取贡献，使得能够实现边缘计算和通信，分布式学习以及分布式分区和智能合同。本文提供了本技术的最先进的概述，并说明了它们的功能和性能，特别关注资源，延迟，隐私和能源消耗之间的权衡。最后，本文提供了一种在节能IIOTE和路线图中集成这些能力技术的愿景，以解决开放的研究挑战

translated by 谷歌翻译

A reduced-order modeling framework for simulating signatures of faults in a bladed disk

Divya Shyam Singh , Atul Agrawal , D. Roy Mahapatra

分类：机器学习

2021-08-13

本文报告了旋转轴上刀片磁盘的减少订单建模框架，以模拟故障的振动特征，例如旨在模拟数据驱动机器学习的不同组件中的裂纹。我们采用了子组件的总和分析模型，以更好地了解复杂的动态响应。该框架旨在解决分析和优化故障检测和识别方案时遇到的一些挑战，用于旋转涡轮机械（包括航空引擎）的健康监测。我们通过组合总元件和一维有限元素来对刀片磁盘和轴进行建模，从而导致系统。模拟结果与先前发布的数据非常吻合。我们通过分析刀片中的裂纹及其有效降低刚度近似进行建模。对多种类型的故障进行了建模，包括单个和两阶段刀片的叶片中的裂缝，扇形刀片（FBO）和异物损坏（FOD）。我们已经应用了航空发动机操作加载条件，以模拟在线健康监测的现实情况。所提出的减少阶数模拟框架将在概率信号建模，朝故障签名识别的机器学习以及具有测量振动信号的参数估计中提供应用。

translated by 谷歌翻译

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation

Tejas D. Kulkarni , Karthik R. Narasimhan , Ardavan Saeedi , Joshua B. Tenenbaum

分类：

2016-04-20

Learning goal-directed behavior in environments with sparse feedback is a major challenge for reinforcement learning algorithms. The primary difficulty arises due to insufficient exploration, resulting in an agent being unable to learn robust value functions. Intrinsically motivated agents can explore new behavior for its own sake rather than to directly solve problems. Such intrinsic behaviors could eventually help the agent solve tasks posed by the environment. We present hierarchical-DQN (h-DQN), a framework to integrate hierarchical value functions, operating at different temporal scales, with intrinsically motivated deep reinforcement learning. A top-level value function learns a policy over intrinsic goals, and a lower-level function learns a policy over atomic actions to satisfy the given goals. h-DQN allows for flexible goal specifications, such as functions over entities and relations. This provides an efficient space for exploration in complicated environments. We demonstrate the strength of our approach on two problems with very sparse, delayed feedback: (1) a complex discrete stochastic decision process, and (2) the classic ATARI game 'Montezuma's Revenge'.

translated by 谷歌翻译

Invalidator: Automated Patch Correctness Assessment via Semantic and Syntactic Reasoning

Thanh Le-Cong , Duc-Minh Luong , Xuan Bach D. Le , David Lo , Nhat-Hoa Tran , Bui Quang-Huy , Quyet-Thang Huynh

分类：机器学习

2023-01-03

In this paper, we propose a novel technique, namely INVALIDATOR, to automatically assess the correctness of APR-generated patches via semantic and syntactic reasoning. INVALIDATOR reasons about program semantic via program invariants while it also captures program syntax via language semantic learned from large code corpus using the pre-trained language model. Given a buggy program and the developer-patched program, INVALIDATOR infers likely invariants on both programs. Then, INVALIDATOR determines that a APR-generated patch overfits if: (1) it violates correct specifications or (2) maintains errors behaviors of the original buggy program. In case our approach fails to determine an overfitting patch based on invariants, INVALIDATOR utilizes a trained model from labeled patches to assess patch correctness based on program syntax. The benefit of INVALIDATOR is three-fold. First, INVALIDATOR is able to leverage both semantic and syntactic reasoning to enhance its discriminant capability. Second, INVALIDATOR does not require new test cases to be generated but instead only relies on the current test suite and uses invariant inference to generalize the behaviors of a program. Third, INVALIDATOR is fully automated. We have conducted our experiments on a dataset of 885 patches generated on real-world programs in Defects4J. Experiment results show that INVALIDATOR correctly classified 79% overfitting patches, accounting for 23% more overfitting patches being detected by the best baseline. INVALIDATOR also substantially outperforms the best baselines by 14% and 19% in terms of Accuracy and F-Measure, respectively.

translated by 谷歌翻译

SIRL: Similarity-based Implicit Representation Learning

Andreea Bobu , Yi Liu , Rohin Shah , Daniel S. Brown , Anca D. Dragan

分类：机器人 | 人工智能 | 机器学习

2023-01-02

When robots learn reward functions using high capacity models that take raw state directly as input, they need to both learn a representation for what matters in the task -- the task ``features" -- as well as how to combine these features into a single objective. If they try to do both at once from input designed to teach the full reward function, it is easy to end up with a representation that contains spurious correlations in the data, which fails to generalize to new settings. Instead, our ultimate goal is to enable robots to identify and isolate the causal features that people actually care about and use when they represent states and behavior. Our idea is that we can tune into this representation by asking users what behaviors they consider similar: behaviors will be similar if the features that matter are similar, even if low-level behavior is different; conversely, behaviors will be different if even one of the features that matter differs. This, in turn, is what enables the robot to disambiguate between what needs to go into the representation versus what is spurious, as well as what aspects of behavior can be compressed together versus not. The notion of learning representations based on similarity has a nice parallel in contrastive learning, a self-supervised representation learning technique that maps visually similar data points to similar embeddings, where similarity is defined by a designer through data augmentation heuristics. By contrast, in order to learn the representations that people use, so we can learn their preferences and objectives, we use their definition of similarity. In simulation as well as in a user study, we show that learning through such similarity queries leads to representations that, while far from perfect, are indeed more generalizable than self-supervised and task-input alternatives.

translated by 谷歌翻译

G-CEALS: Gaussian Cluster Embedding in Autoencoder Latent Space for Tabular Data Representation

Manar D. Samad , Sakib Abrar

分类：机器学习 | 人工智能

2023-01-02

The latent space of autoencoders has been improved for clustering image data by jointly learning a t-distributed embedding with a clustering algorithm inspired by the neighborhood embedding concept proposed for data visualization. However, multivariate tabular data pose different challenges in representation learning than image data, where traditional machine learning is often superior to deep tabular data learning. In this paper, we address the challenges of learning tabular data in contrast to image data and present a novel Gaussian Cluster Embedding in Autoencoder Latent Space (G-CEALS) algorithm by replacing t-distributions with multivariate Gaussian clusters. Unlike current methods, the proposed approach independently defines the Gaussian embedding and the target cluster distribution to accommodate any clustering algorithm in representation learning. A trained G-CEALS model extracts a quality embedding for unseen test data. Based on the embedding clustering accuracy, the average rank of the proposed G-CEALS method is 1.4 (0.7), which is superior to all eight baseline clustering and cluster embedding methods on seven tabular data sets. This paper shows one of the first algorithms to jointly learn embedding and clustering to improve multivariate tabular data representation in downstream clustering.

translated by 谷歌翻译

Skew Class-balanced Re-weighting for Unbiased Scene Graph Generation

Haeyong Kang , Chang D. Yoo

分类：机器学习

2023-01-01

An unbiased scene graph generation (SGG) algorithm referred to as Skew Class-balanced Re-weighting (SCR) is proposed for considering the unbiased predicate prediction caused by the long-tailed distribution. The prior works focus mainly on alleviating the deteriorating performances of the minority predicate predictions, showing drastic dropping recall scores, i.e., losing the majority predicate performances. It has not yet correctly analyzed the trade-off between majority and minority predicate performances in the limited SGG datasets. In this paper, to alleviate the issue, the Skew Class-balanced Re-weighting (SCR) loss function is considered for the unbiased SGG models. Leveraged by the skewness of biased predicate predictions, the SCR estimates the target predicate weight coefficient and then re-weights more to the biased predicates for better trading-off between the majority predicates and the minority ones. Extensive experiments conducted on the standard Visual Genome dataset and Open Image V4 \& V6 show the performances and generality of the SCR with the traditional SGG models.

translated by 谷歌翻译

Lightmorphic Signatures Analysis Toolkit

D. Damian

分类：机器学习

2022-12-31

In this paper we discuss the theory used in the design of an open source lightmorphic signatures analysis toolkit (LSAT). In addition to providing a core functionality, the software package enables specific optimizations with its modular and customizable design. To promote its usage and inspire future contributions, LSAT is publicly available. By using a self-supervised neural network and augmented machine learning algorithms, LSAT provides an easy-to-use interface with ample documentation. The experiments demonstrate that LSAT improves the otherwise tedious and error-prone tasks of translating lightmorphic associated data into usable spectrograms, enhanced with parameter tuning and performance analysis. With the provided mathematical functions, LSAT validates the nonlinearity encountered in the data conversion process while ensuring suitability of the forecasting algorithms.

translated by 谷歌翻译